Cross-Modal Feature Representation Learning and Label Graph Mining in a Residual Multi-Attentional CNN-LSTM Network for Multi-Label Aerial Scene Classification
نویسندگان
چکیده
The results of aerial scene classification can provide valuable information for urban planning and land monitoring. In this specific field, there are always a number object-level semantic classes in big remote-sensing pictures. Complex label-space makes it hard to detect all the targets perceive corresponding semantics typical scene, thereby weakening sensing ability. Even worse, preparation labeled dataset training deep networks is more difficult due multiple labels. order mine visual features make good use label dependency, we propose novel framework article, namely Cross-Modal Representation Learning Label Graph Mining-based Residual Multi-Attentional CNN-LSTM (CM-GM framework). framework, residual multi-attentional convolutional neural network developed extract image features. Moreover, labels embedded by language model then form graph which be further mapped advanced (GCN). With these cross-modal feature representations (image, text), will enhanced aligned GCN-based embeddings. After that, signals fed into bi-LSTM subnetwork according built graph. CM-GM able map both graph-based correlated space appropriately, using dependency efficiently, thus improving LSTM predictor’s Experimental show that proposed achieve higher accuracy on many multi-label benchmark datasets remote field.
منابع مشابه
Deep learning for multi-label scene classification
Scene classification is an important topic in computer vision. For similar weather conditions, there are some obstacles for extracting features from outdoor images. In this thesis, I present a novel approach to classify cloudy and sunny weather images. Inspired by recent study of a deep convolutional neural network and the spatial pyramid matching, I generate a model based on the ImageNet datas...
متن کاملMulti-label Semantic Scene Classification
In classic pattern recognition problems, classes are mutually exclusive by definition. Classification errors occur when the classes overlap in the feature space. We examine a different situation, occurring when the classes are, by definition, not mutually exclusive. Such problems arise in semantic scene and document classification and in medical diagnosis. We present a framework to handle such ...
متن کاملMulti-Instance Multi-Label Learning with Application to Scene Classification
In this paper, we formalize multi-instance multi-label learning, where each training example is associated with not only multiple instances but also multiple class labels. Such a problem can occur in many real-world tasks, e.g. an image usually contains multiple patches each of which can be described by a feature vector, and the image can belong to multiple categories since its semantics can be...
متن کاملExploiting Associations between Class Labels in Multi-label Classification
Multi-label classification has many applications in the text categorization, biology and medical diagnosis, in which multiple class labels can be assigned to each training instance simultaneously. As it is often the case that there are relationships between the labels, extracting the existing relationships between the labels and taking advantage of them during the training or prediction phases ...
متن کاملFeature-aware Label Space Dimension Reduction for Multi-label Classification
Label space dimension reduction (LSDR) is an efficient and effective paradigm for multi-label classification with many classes. Existing approaches to LSDR, such as compressive sensing and principal label space transformation, exploit only the label part of the dataset, but not the feature part. In this paper, we propose a novel approach to LSDR that considers both the label and the feature par...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Remote Sensing
سال: 2022
ISSN: ['2072-4292']
DOI: https://doi.org/10.3390/rs14102424